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named mean-based training (MBT) is applied urgently to overcome 
the matching delay caused by the long feature vector. The last novelty aspect 
Keywords: is utilizing the one-to-one mapping relationship for fusing the edge-to-angle 
unimodal classification results into a multimodal system using the logical-OR rule. 


Decision fusion Despite some dataset difficulties like unconstrained facial images (UFI) 


Edge-angle S which includes varying illuminations, expressions, occlusions, and poses, 
Face recognition the multimodal system has highly improved the accuracy rate and achieved 
Optimization a promising recognition result, where the decision fusion is classified 
Phase congurency correctly (84, 92, and 72%) with only one training vector per MBT in contrast to 
(80, 62, and 68%) with five training vectors for normal matching. These results 
are measured by Eucledian, Manhattan, and Cosine distance measure respectively. 
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1. INTRODUCTION 

Phase congruency (PC) is a contrast and illumination invariant measure of valuable features. 
Unlike gradient-based feature detectors, which can only detect the first or second derivative features (also 
called step features), phase congruency correctly detects all kind of phase angle related features, and not just 
step features having O or 180 phase angle degrees. Although the complexity in calculating the PC features, 
it is considered an immune method against noise, contrast variations, and illumination. Also, the amount 
of redundant data is effectively reduced after extraction stage, which is an important process for building 
features that: speed-up learning process, phase generalization, and time saving. Generally, it is the step that 
classification process comes after. One of the biggest challenges in this work is the removing of the feature 
extraction stage for reducing the compuations complexity and considering only the detected PC features. 
These features (edge-angle) are considered here for constructing separately two face recognition unimodal 
systems: edge-based modal and angle-based modal. Then the recognition accuracies for both modals 
are computed for different databases, training methods, and classifiers to determine how much they would 
be improved after applying one-to-one or edge-to-angle mapping relationship and decision fusion technique. 
Next sections demonstrate three different approaches in finding PC formulations in term of 1-D, Gabor 
Wavelets, and 2-D Laplass of Gaussian and Gabor. 
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2. RELATED WORKS 

P. Kovesi was the most interested in the PC field since 1999, a calculation for 1-D signal was 
extended to 2-D images using wavelet high-pass filters to obtain image information at different scales where 
a universal threshold value over wide class of images was applied [1]. The advantages of using PC over 
Canny edge detectors was explained and approved using standard images [2]. PC-based face recognition 
technique for improving the recognition rates of the faces that are affected due to varying illuminations, 
partial occlusions, and varying expressions was proposed and implemented [3]. A descriptor based on the PC 
concept, called histogram of oriented phase (HOP) was applied to depict and represent the human objects 
more efficiently than the gradient based approach [4]. A modular kernel eigen spaces approach was 
implemented on the PC images to localize nonlinear feature selection procedure to help overcoming 
the bottlenecks of illumination variations, partial occlusions, and expression variations [5]. A 2-D multi-scale 
phase congruency (2D-MSPC) software for detecting and evaluation of image features could appropriately 
tune many parameters for optimal image features detection, these parameters are optimized for maximum and 
minimum moments [6]. A modified algorithm of PC to locate image features using Hilbert transform was 
implemented and the local energy was obtained by convoluting original image with two operators of 
removing direct current (DC) component over current window and 2-D Hilbert transform respectively. 
The local energy was divided with the sum of Fourier amplitude of current window to retrieve the value of 
PC [7]. A quality matric for evaluation of different video fusion methods using set of predefined 3D Gabor 
filters was employed to compute the spatial temporal PC for input and fused videos [8]. 

The local binary pattern (LBP) on the ultra sound (US) medical images after computing their PCs 
was used to improve the reliable feature point localization [9]. The multi-focus image fusion applied 
new fusion rule and complex Gabor wavelet to obtain the benefits of PC sharpness in finding new focus 
measure [10]. Two different multi-spectral image fusion rules for nonsubsampled contourlet transform 
(NSCT) was introduced with: PC, principle component analysis (PCA), directive contrast, and entropy for 
developing integrity model [11]. A fusion of the bio-medical images and guided filter was implemented by 
decomposing the bio-medical images into two sub-bands using the non-subsampled contourlet transform. 
The guided filter was very effective in extracted the high frequency details for obtaining the final fused 
image in spatial domain [12]. The edges in the liver image was enhanced by improving the segmentation 
accuracy. The sum modified Laplacian (SML), contrast features measure, and non-subsampled shearlet 
transform (NSST) were integrated to reconstruct a model for better fusion of edge information [13]. Pearson 
correlation coefficient and Spearman rank order correlation coefficient [14] were used to evaluate the 
performance of the proposed metric. The proposal developed a novel image quality and metric as a set of ten 
extracted features from each distorted image, then relevance vector machine algorithm (RVM) was used to 
learn the mapping between the human opinion scores and combined features. A novel descriptor using multi- 
oriented PC and magnitude information was presented to show the advantages of the histograms of oriented 
magnitude (HOM) and Histograms of oriented phase congruency (HPC) combination, also it confirmed that 
the histograms of oriented magnitude and phase congruency (HOMPC) was so superior to the local feature in 
the state-of-the-art descriptors. Three datasets were composed of multi-sensor remote sensing images 
obtained from satellite platforms, unmanned aerial vehicle, and unmanned ground vehicle [15]. An immune 
model of PC against illumination and contrast variation was developed. An obtained result from variety of 
remote sensing images showed an excellent performance against the local invariant and robust of the model 
to both geometric and radiometric changes [16]. The negatively impact was avoided using the same fusion 
rule on different scales, also a Gaussian filter with multi-scale decomposition (MSD) of total variation (TV) 
and PC were proposed for designing the modal [17]. 


3. PHASE CONGRUENCY MEASURES 

Three different methods are presented in the following sections to determine a new operator that 
emploies the phase congruency principle moments for computing the edge and angle information which is 
invariant to the image contrast, noise, and illumination. [18-20]: 


3.1. Fourier transform 

The fourier series is expanded at some location point, x to define the one-dimension phase 
congruency function. Features are detected by founding the Fourier components that has maximum phase, as 
in (1) [1, 2]. 


(1) 


Zul Fp, | cos(Ø, (x) = Ø, = 
du [Fpa | 


PC(x) = maxg (xe, 0,20] ( 
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Where: %.@) and %.@) are the local and mean phase angles of the frequency component Fp. at x 
respectively. The aim is to maximize Equestion (1) by maximizing the amplitude weighted mean for local 
phase angle for all considered Fourier points of %.G). Hereby, phase congruency is a rather difficult quantity 
to be computed, as finding where phase congruency is a maximum is approximately equivalent finding where 
the weighted variance of local phase angles, relative to the weighted average local phase, is a minimum [1, 2]. 


3.2. Wavelet transform 
The phase congruency in (1) is sensitive to the noise and it is not well localized because the measure 
changes with the difference in phase not in the small responses or FP. magnitude, so for small angle value 


agah ' . . . 
(converges to zero), the °° Coral /2 explains how phase difference effects the weighted magnitudes 


IFp.|. So, an alternative approach is needed to find a maximum local energy as phase congruency is directly 
proportional to it [2]. The phase congruency performance is improved with presence of noise. 
The convolution of a signal, /(x) with a set of cosine and sine Wavelets at scale m will deliver the even and 
odd components at the m” scale. The amplitude of this transformation is the local energy 4m@as explained in 
(2, 3, 4,5, 6 and 7) [21, 22]. The phase congruency measurement is defined in (8). 


Le, (X), Om (x)| = L(x) * ME I(x) * MP | (2) 


Am (x) = Vm (x)? + Om (x)? (3) 


where: 
I(x) : 1s the one-dimension signal, 
Min» Mm : are the cosine and sine Wavelets at the m” scale respectively, 
em (xX), Om (x) : are the even and odd components, 
Am (x) : 1s the local energy or the amplitude of transformed results, so the total energy A is: 
X An m Y Ven GF On CP (4) 
Let, F(x) ~ » em (x) (5) 
m 
and H(x) = J On (x) (6) 
m 
Where E(x) = ./F(x)? + H(x)? (7) 
E(x) 
PC(x) = ———— (8) 
din An (x) + € 


€: is a small value between O and 1 


3.3. Two-dimensional log-Gabor 

The extension of frequency domain considerations to 2D image was made up by convolving set of 
frequency domain constructed filters with an image. So, a construction of Fourier domain filter like 
log-Gabor function is very suitable as it has complementary spreading functions and singularity at DC 
frequency. Starting from the low-pass Gaussian filter g with K different orientations has a transfer function 
defined in (9) [18, 19, 21, 23, 24]. 





(80-0) 
aE E: 
g0, 0) = e 295 (9) 
V2TO, 
where, 
o ; ? are the orientation and angle of the local orientation respectively 
O; : controls the spreading around the orientation Oo 
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The Laplace of Gaussian log and Gabor band-pass filter g have spreading function /9 (w, Wm) with M 
different scales as defined in (10). 


0 W = 
- (log( w/ wm))? 
1g(@, wmn) = 2(log(£))? (10) 
1 
————c WF 
V 20, 
where, “ > “m: are the scale and the center frequency at this scale respectively, P : controls bandwidth at 
— k 
this scale, p= 4 Wm , the functions combination will result a 2D filter 12D which works at different 


scales and orientations as in (11). 
12Dg(w, wm 80,0 ) = g(00,0) xX Iglw, wm ) (11) 


so, the convolution of this filter with image /(x,y) delivers the phase congruency measurement at M scales 
after inversing the fourier transformation, 


E(mM)xy = 3! (I2Dg(w, wm 800 xy * Ixy (12) 
EME (m)l 

PCy = a l (13) 
M_E] +E 


4. UNIMODAL SYSTEMS DESIGN 

Two unimodal systems based face trait are designed, the first one is edge-based modal and 
the second one is angel-based modal. Figure 1 is an example of PC outputs applied on four dataset images 
with their edge-gradient and angle-gradient features, where the source code in [25] is applied with its 
experimental optimum parameters to compute the PC feature matrices. The tuned parameters include: no. of 
wavelet scales=4, no. of filter orientations=6, min Wavelength scale filter=3, ratio of the standard deviation 
of the Gaussian=0.55, filter orientations to the Gaussian standard deviation ratio =1.2, standard deviations of 
noise threshold point=2.0. The two modals experience serious challenge in the matching time causes by 
cancelling the feature extraction phase. Therefore, a new modification toward determining the mean vector 
for each class in the training datasets is turned on as explained in the next section. 





Figure 1. Phase congruency feature detection 
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5. MEAN-BASED TRAINING 

The MBT vector can be calculated directly during system running as explained in (14), For purpose 
of comparison and performance evaluation, matching stage is implemented in both modes: normal matching 
and MBT matching. Hereby, each unimodal system will output forty-eight classification results comes from: 
4 databases x 2 unimodals (edge and angle) x 2 training-modes (MBT and normal) x 3 classifiers, as detailed 
in Figure 2. The recognition rate is calculated at minimum equal error rate (EER) value that resultant from 
intersection of false acceptance rate (FAR) and false rejection rate (FRR) curves. The four databases 
specifications are tabulated in Table | for farther explaination. 


MBT = 


n > 
dino Vi (14) 
n 


> 
where, Yi : is a machine column vector, and n is the number of training vector in a class. 


Table 1. Face databases descriptions 


Database Subjects Test image Training images Total samples Image dimension 
AR 50 1 5 250 120x165 
ORL 40 1 8 320 112x92 
VIS-CH 40 1 10 400 225x321 
UFI 50 1 5 250 128x128 


6. DECISION FUSION 

The edge-angle classification results of 180 subjects are fused together using the logical OR-gate as 
explained in Figure 2. This step is greatly improved the overall system performance as the experimental 
results approved that in Table 2 to Table 7. 


Templates ' Unimodal ' edge-to-angle | Multimodal ! 
Soares ' Classifiers | Fusion Accuracies 
Edge 
ED L J} -- > 
Angle 
Edge 
Mean-based MD h - -> 
Training 
- Angle 
Edge 
CD -- > 
Phase congruency ii 
features da. a 
(Four databases) Edge 
of ps 
Angle 
= Edge 
Norma 
MD = H Sui 
Angle 
Edge 
CD a --- >» 
Angle 


Figure 2. Edge-angle decision fusion construction using two training modes and three classifiers 


Table 2. MBT accuracy using edge, angle, and fusion methods with Euclidean classifier 


Databases = mRecursoya_JEuelidean = 
Edge Angle Fusion 
AR 79.6 96.2 98 
ORL 80.9 81.1 95 
VIS-TH 84.6 83.3 95 
UFI 57.6 53.8 84 


Optimized multimodal biometric system based fusion technique for human identification (Muthana H. Hamd) 


2416 O ISSN: 2302-9285 


Table 3. MBT accuracy using edge, angle, and fusion methods with Manhattan classifier 


Maximum Accuracy% Manhattan 
Database... FT 


Edge Angle Fusion 
AR 82.7 92.8 96 
ORL 81.5 82.6 92.5 
VIS-TH 85 80.9 95 
UFI Jh 55.8 92 


Table 4. MBT accuracy using edge, angle, and fusion methods with Cosine classifier 


Dia Maximum Accuracy% Cosine 
Edge Angle Fusion 
AR 94.6 91.5 98 
ORL 89.9 82.1 95 
VIS-TH 97 76.2 97.5 
UFI 59.4 57.4 72 


Table 5. Normal accuracy using edge, angle, and fusion methods with Euclidean classifier 


Database —Maimum Accuracy% Euclidean _ 
Edge Angle Fusion 
AR 83.8 88.9 98 
ORL 85.5 78.4 95 
VIS-TH 84.3 86.2 97 
UFI 52 58.7 80 


Table 6. Normal accuracy using edge, angle, and fusion methods with Manhattan classifier 
Maximum Accuracy% Manhattan 


Database Edge Angle Fusion 
AR 88.3 96 98 
ORL 85.8 91.8 O75 

VIS-TH 87.6 93.6 97 
UFI 599 59.4 62 


Table 7. Normal accuracy using edge, angle, and fusion methods with Cosine classifier 


Database Maximum Accuracy% Cosine 
Edge Angle Fusion 
AR 95.5 94.8 100 
ORL 89.3 90.1 92.5 
VIS-TH 94.7 90 97.5 
UFI 56.4 59.3 68 


7. RESULTS AND DISCUSSION 

The 180 individuals are classified into genuine or imposter according to many considerations such 
as: edge or angle feature, MBT or Normal training, and type of classiffier. Generally, the maximum 
performance results in Table 2 to Table 7 can be described and categorized in to unimodal-based MBT 
performance, unimodal-based Normal training performance, and deciaion fusion performance as follows: 


7.1. MBT training performance 

The Euclidean classifier based unimodal system satisfied 96.2% maximum accuracy while 
the decision fusion improved this rate to 98%. The increment percentage is 1.8. The Manhattan classifier 
based unimodal system satisfied 92.8% maximum accuracy while the decision fusion improved this rate to 
96%. The increment percentage is 1.2. The Cosine classifier based unimodal system satisfied 97% maximum 
accuracy while the decision fusion improved this rate to 98%. The increment percentage is 1. 


7.2. Normal training performance 

The Euclidean classifier based unimodal system satisfied 88.9% maximum accuracy while 
the decision fusion improved this rate to 98%. The increment percentage is 9.1. The Manhattan classifier 
based unimodal system satisfied 96% maximum accuracy while the decision fusion improved this rate to 
98%. The increment percentage is 2. The Cosine classifier based unimodal system satisfied 95.5% maximum 
accuracy while the decision fusion improved this rate to 100%. The increment percentage is 4.5. 
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Hereby, the MBT training mode has achieved better performance than Normal training mode when 
the Euclidean and Cosine measurements is applied. Furthermore, the decision fusion technique could 
improve the unimodal accuracies for all cases and from Table 7, it achieved 100% accuracy rate with 
increment percentage of 4.5 using: AR dataset, Normal training mode, and Cosine classification. In contrast 
to the high accuracy rates of AR dataset as explained in the Figures 3 and 4, the UFI dataset contains worst 
facial images that have different face representation and poses. From Table 3, the decision fusion technique 
achieved 92% accuracy rate using Manhattan classifier in contrast to 57.7% and 55.8% for edge and angle 
unimodal systems respectively. This means that, the decision fusion has improved the increment percentage 
by 34.3 according to the difference between the two maximum accuracies 92% and 57.7%. 
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Figure 3. Maximum accuracy of multimodal system Figure 4 Maximum accuracy of multimodal system 
using MBT training using normal training 


8. CONCLUSION 

This paper presented three new concepts in development of human identification system, based on 
the face recognition approaches. First, the regular feature extraction methods had not been employed within 
the design, therefore, the phase congruency was the only used approach as a facial feature’s detection. 
This step had an advantage in reducing the biometric system complexity and computation costs. Secondly, 
the large feature vector size that resulted from the phase congruency detection had now been appropriately 
resolved with the use of mean-based training technique. The recognition rates for this technique had 
advanced the normal training in most cases. At last, this work was the first to fuse the decision results for 
the two separable face features with edge-to-angle relationship. The decision fusion outputs highly improved 
the overall system performance, even for a low quality dataset such as the UFI. The multimodal system was 
able to accurately recognize 92% in contrast to 57.7% for a unimodal system, using mean-based training and 
Manhattan classifier. The new vision of those approved concepts can be a guidance for new developers in 
their future biometric system plans. 
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